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EcoRI Nhei 

1 TAGAATTCGTAGGCTAGCATGCAGATCGAGCTGAGCACCTGCTTCTTCCTGTGCCTGCTGCGCTTCTGCTTC 

1^ MetGlnlleGluLeuSerThrCysPhePheLeuCysLeuLeuArgPheCysPhe 
73 AGCGCCACCCGCCGCTACTACCTGGGCGCCGTGGAGCTGAGCTGGGACTACATGCAGAGCGACCTGGGCGAG 
19 ► serAlaThrArgArgTyrTyrLeuGlyAlaValGluLeuSerTrpAspTyrMetGlnSerAspLeuGlyGlu 
145 CTGCCCGTGGACGCCCGCTTCCCCCCCCGCGTGCCCAAGAGCTTCCCCTTCAACACCAGCGTGGTGTACAAG 
43 ► LeuProValAspAlaArgPheProProArgValProLysSerPheProPheAsnThrSerValValTyrLys 
2 17 AAGACCCTGTTCGTGGAGTTCACCGACCACCTGTTCAACATCGCCAAGCCCCGCCCCCCCTGGATGGGCCTG 
67 ► LysThrLeuPheValGluPheThrAspHisLeuPheAsnlleAlaLysProArgProProTrpMetGlyLeu 
Apal Mscl 
289 CTGGGCCCCACCATCCAGGCCGAGGTGTACGACACCGTGGTGATCACCCTGAAGAACATGGCCAGCCACCCC 
91 ► LeuGlyProThrlleGlnAlaGluValTyrAspThrValVallleThrLeuLysAsnMetAlaSerHisPro 
361 GTGAGCCTGCACGCCGTGGGCGTGAGCTACTGGAAGGCCAGCGAGGGCGCCGAGTACGACGACCAGACCAGC 
115 ► valSerLeuHisAlaValGlyValSerTyrTrpLysAlaSerGluGlyAlaGluTyrAspAspGlnThrSer 
433 CAGCGCGAGAAGGAGGACGACAAGGTGTTCCCCGGCGGCAGCCACACCTACGTGTGGCAGGTGCTGAAGGAG 
139 ► GlnArgGluLysGluAspAspLysValPheProGlyGlySerHisThrTyrValTrpGlnValLeuLysGlu 
Mscl Pmll 

505 AACGGCCCCATGGCCAGCGACCCCCTGTGCCTGACCTACAGCTACCTGAGCCACGTGGACCTGGTGAAGGAC 
163 ► AsnGlyProMetAlaSerAspProLeuCysLeuThrTyrSerTyrLeuSerHisValAspLeuValLysAsp 

Mscl 

577 CTGAACAGCGGCCTGATCGGCGCCCTGCTGGTGTGCCGCGAGGGCAGCCTGGCCAAGGAGAAGACCCAGACC 
187 ^ LeuAsnSerGlyLeuIleGlyAlaLeuLeuValCysArgGluGlySerLeuAlaLysGluLysThrGlnThr 
649 CTGCACAAGTTCATCCTGCTGTTCGCCGTGTTCGACGAGGGCAAGAGCTGGCACAGCGAGACCAAGAACAGC 
211^ LeuHisLysPhelleLeuLeuPheAlaValPheAspGluGlyLysSerTrpHisSerGluThrLysAsnSer 
721 CTGATGCAGGACCGCGACGCCGCCAGCGCCCGCGCCTGGCCCAAGATGCACACCGTGAACGGCTACGTGAAC 
235 ^ LeuMetGlnAspArgAspAlaAlaSerAlaArgAlaTrpProLysMetHisThrValAsnGlyTyrValAsn 

Pmll 

793 CGCAGCCTGCCCGGCCTGATCGGCTGCCACCGC AAGAGCGTGTACTGGCACGTGATCGGCATGGGC ACCACC 
259 ► ArgSerLeuProGlyLeuIleGlyCysHisArgLysSerValTyrTrpHisVallleGlyMetGlyThrThr 
865 CCCGAGGTGCACAGCATCTTCCTGGAGGGCCACACCTTCCTGGTGCGCAACCACCGCCAGGCCAGCCTGGAG 
283 ► ProGluValHisSerllePheLeuGluGlyHisThrPheLeuValArgAsnHisArgGlnAlaSerLeuGlu 
937 ATCAGCCCCATCACCTTCCTGACCGCCCAGACCCTGCTGATGGACCTGGGCCAGTTCCTGCTGTTCTGCCAC 
307 ► HeSerProIleThrPheLeuThrAlaGlnThrLeuLeuMetAspLeuGlyGlnPheLeuLeuPheCysHis 

1009 ATCAGCAGCCACCAGCACGACGGCATGGAGGCCTACGTGAAGGTGGACAGCTGCCCCGAGGAGCCCCAGCTG 
331^ HeSerSerHisGlnHisAspGlyMetGluAlaTyrValLysValAspSerCysProGluGluProGlnLeu 

1081 CGCATGAAGAACAACGAGGAGGCCGAGGACTACGACGACGACCTGACCGACAGCGAGATGGACGTGGTGCGC 
355 ^ ArgMetLysAsnAsnGluGluAlaGluAspTyrAspAspAspLeuThrAspSerGluMetAspValValArg 

(Bglll/BamHI) 

1153 TTCGACGACGACAACAGCCCCAGCTTCATCCAGATCCGCAGCGTGGCCAAGAAGCACCCCAAGACCTGGGTG 
379^ pheAspAspAspAsnSerProSerPhelleGlnlleArgSerValAlaLysLysHisProLysThrTrpVal 

1225 CACTACATCGCCGCCGAGGAGGAGGACTGGGACTACGCCCCCCTGGTGCTGGCCCCCGACGACCGCAGCTAC 
403^ HisTyrlleAlaAlaGluGluGluAspTrpAspTyrAlaProLeuValLeuAlaProAspAspArgSerTyr 

Eagl 

1297 AAGAGCCAGTACCTGAACAACGGCCCCCAGCGCATCGGCCGCAAGTACAAGAAGGTGCGCTTCATGGCCTAC 
427 ^ LysSerGlnTyrLeuAsnAsnGlyProGlnArglleGlyArgLysTyrLysLysValArgPheMetAlaTyr 

Apal 

1369 ACCGACGAGACCTTCAAGACCCGCGAGGCCATCCAGCACGAGAGCGGCATCCTGGGCCCCCTGCTGTACGGC 
451^ ThrAspGluThrPheLysThrArgGluAlalleGlnHisGluSerGlylleLeuGlyProLeuLeuTyrGly 
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1441 GAGGTGGGCGACACCCTGCTGATCATCTTCAAGAACCAGGCCAGCCGCCCCTACAACATCTACCCCCACGGC 
475> GluValGlyAspThrLeuLeuIlellePheLysAsnGlnAlaSerArgProTyrAsnlleTyrProHisGly 

1513 ATCACCGACGTGCGCCCCCTGTACAGCCGCCGCCTGCCCAAGGGCGTGAAGCACCTGAAGGACTTCCCCATC 
499^ HeThrAspValArgProLeuTyrSerArgArgLeuProLysGlyValLysHisLeuLysAspPheProIle 
Bgltl 

1585 CTGCCCGGCGAGATCTTCAAGTACAAGTGGACCGTGACCGTGGAGGACGGCCCCACCAAGAGCGACCCCCGC 
523 ► LeuProGlyGluIlePheLysTyrLysTrpThrValThrValGluAspGlyProThrLysSerAspProArg 

1657 TGCCTGACCCGCTACTACAGCAGCTTCGTGAACATGGAGCGCGACCTGGCCAGCGGCCTGATCGGCCCCCTG 
547 ► CysLeuThrArgTyrTyrSerSerPheValAsnMetGluArgAspLeuAlaSerGlyLeuIleGlyProLeu 

1729 CTGATCTGCTACAAGGAGAGCGTGGACCAGCGCGGCAACCAGATCATGAGCGACAAGCGCAACGTGATCCTG 
571^ LeuIleCysTyrLysGluSerValAspGlnArgGlyAsnGlnlleMetSerAspLysArgAsnVallleLeu 

Kpnl 

1801 TTCAGCGTGTTCGACGAGAACCGCAGCTGGTACCTGACCGAGAACATCCAGCGCTTCCTGCCCAACCCCGCC 

595 ► pheSerValPheAspGluAsnArgSerTrpTyrLeuThrGluAsnlleGlnArgPheLeuProAsnProAla 
1873 GGCGTGCAGCTGGAGGACCCCGAGTTCCAGGCCAGCAACATCATGCACAGCATCAACGGCTACGTGTTCGAC 

619 ► GlyValGlnLeuGluAspProGluPheGlnAlaSerAsnlleMetHisSerlleAsnGlyTyrValPheAsp 
1945 AGCCTGCAGCTGAGCGTGTGCCTGCACGAGGTGGCCTACTGGTACATCCTGAGCATCGGCGCCCAGACCGAC 

643 ► SerLeuGlnLeuSerValCysLeuHisGluValAlaTyrTrpTyrlleLeuSerlleGlyAlaGlnThrAsp 
2017 TTCCTGAGCGTGTTCTTCAGCGGCTACACCTTCAAGCACAAGATGGTGTACGAGGACACCCTGACCCTGTTC 

667> pheLeuSerValPhePheSerGlyTyrThrPheLysHisLysMetValTyrGluAspThrLeuThrLeuPhe 

BamHI 

2089 CCCTTCAGCGGCGAGACCGTGTTCATGAGCATGGAGAACCCCGGCCTGTGGATCCTGGGCTGCCACAACAGC 
691^ ProPheSerGlyGluThrValPheMetSerMetGluAsnProGlyLeuTrplleLeuGlyCysHisAsnSer 

2161 GACTTCCGCAACCGCGGCATGACCGCCCTGCTGAAGGTGAGCAGCTGCGACAAGAACACCGGCGACTACTAC 
715* AspPheArgAsnArgGlyMetThrAlaLeuLeuLysValSerSerCysAspLysAsnThrGlyAspTyrTyr 

2233 GAGGACAGCTACGAGGACATCAGCGCCTACCTGCTGAGCAAGAACAACGCCATCGAGCCCCSCC2GGAGGAG 
739* GluAspSerTyrGluAspIleSerAlaTyrLeuLeuSerLysAsnAsnAlalleGluProArgLeuGluGlu 

BstXI 

2305 ATCACCCGCACCACCCTGCAGAGCGACCAGGAGGAGATCGACTACGACGACACCATCAGCGTGGAGATGAAG 
763 ► HeThrArgThrThrLeuGlnSerAspGlnGluGluIleAspTyrAspAspThrlleSerValGluMetLys 

2377 AAGGAGGACTTCGACATCTACGACGAGGACGAGAACCAGAGCCCCCGCAGCTTCCAGAAGAAGACCCGCCAC 
787 ► LysGluAspPheAspIleTyrAspGluAspGluAsnGlnSerProArgSerPheGlnLysLysThrArgHis 

Pmll 

2449 TACTTCATCGCCGCCGTGGAGCGCCTGTGGGACTACGGCATGAGCAGCAGCCCCCACGTGCTGCGCAACCGC 
811 ► TyrPhelleAlaAlaValGluArgLeuTrpAspTyrGlyMetSerSerSerProHisValLeuArgAsnArg 

2521 GCCCAGAGCGGCAGCGTGCCCCAGTTCAAGAAGGTGGTGTTCCAGGAGTTCACCGACGGCAGCTTCACCCAG 
835* AlaGlnSerGlySerValProGlnPheLysLysValValPheGlnGluPheThrAspGlySerPheThrGln 

Apal 

2593 CCCCTGTACCGCGGCGAGCTGAACGAGCACCTGGGCCTGCTGGGCCCCTACATCCGCGCCGAGGTGGAGGAC 
859 ► ProLeuTyrArgGlyGluLeuAsnGluHisLeuGlyLeuLeuGlyProTyrlleArgAlaGluValGluAsp 
BstEII 

2665 AACATCATGGTGACCTTCCGCAACCAGGCCAGCCGCCCCTACAGCTTCTACAGCAGCCTGATCAGCTACGAG 
883 ► AsnlleMetValThrPheArgAsnGlnAlaSerArgProTyrSerPheTyrSerSerLeuIleSerTyrGlu 

2737 GAGGACCAGCGCCAGGGCGCCGAGCCCCGCAAGAACTTCGTGAAGCCCAACGAGACCAAGACCTACTTCTGG 
907^ GluAspGlnArgGlnGlyAlaGluProArgLysAsnPheValLysProAsnGluThrLysThrTyrPheTrp 

2809 AAGGTGCAGCACCACATGGCCCCCACCAAGGACGAGTTCGACTGCAAGGCCTGGGCCTACTTCAGCGACGTG 
931* LysValGlnHisHisMetAlaProThrLysAspGluPheAspCysLysAlaTrpAlaTyrPheSerAspVal 
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2881 GACCTGGAGAAGGACGTGCACAGCGGCCTGATCGGCCCCCTGCTGGTGTGCCACACCAACACCCTGAACCCC 
955 ► AspLeuGluLysAspValHisSerGlyLeuIleGlyProLeuLeuValCysHisThrAsnThrLeuAsnPro 
Eagl BstEII 

2953 GCCC ACGGCCGCCAGGTGACCGTGCAGGAGTTCGCCCTGTTCTTCACCATCTTCGACGAGACCAAGAGCTGG 
979 ► AlaHisGlyArgGlnValThrValGlnGluPheAlaLeuPhePheThrllePheAspGluThrLysSerTrp 
3025 TACTTCACCGAGAACATGGAGCGCAACTGCCGCGCCCCCTGCAACATCCAGATGGAGGACCCCACCTTCAAG 
1003 ► TyrPheThrGluAsnMetGluArgAsnCysArgAlaProCysAsnlleGlnMetGluAspProThrPheLys 
3097 GAGAACTACCGCTTCCACGCCATCAACGGCTACATCATGGACACCCTGCCCGGCCTGGTGATGGCCCAGGAC 
1027 ► GluAsnTyrArgPheHisAlalleAsnGlyTyrlleMetAspThrLeuProGlyLeuValMetAlaGlnAsp 

Kpnl Pmll 
3169 CAGCGCATCCGCTGGTACCTGCTGAGCATGGGCAGCAACGAGAACATCCACAGCATCCACTTCAGCGGCCAC 
1051^ GlnArglleArgTrpTyrLeuLeuSerMetGlySerAsnGluAsnlleHisSerlleHisPheSerGlyHis 
3241 GTGTTC ACCGTGCGCAAGAAGGAGGAGTACAAGATGGCCCTGTAC AACCTGTACCCCGGCGTGTTCGAGACC 
1075 ► ValPheThrValArgLysLysGluGluTyrLysMetAlaLeuTyrAsnLeuTyrProGlyValPheGluThr 
3313 GTGGAGATGCTGCCCAGCAAGGCCGGCATCTGGCGCGTGGAGTGCCTGATCGGCGAGCACCTGCACGCCGGC 
1099^ ValGluMetLeuProSerLysAlaGlylleTrpArgValGluCysLeuIleGlyGluHisLeuHisAlaGly 
3385 ATGAGCACCCTGTTCCTGGTGTACAGCAACAAGTGCCAGACCCCCCTGGGCATGGCCAGCGGCCACATCCGC 
1123 ► MetSerThrLeuPheLeuValTyrSerAsnLysCysGlnThrProLeuGlyMetAlaSerGlyHisIleArg 

Apal 

3457 GACTTCCAGATCACCGCCAGCGGCCAGTACGGCCAGTGGGCCCCCAAGCTGGCCCGCCTGCACTACAGCGGC 
1147 ► AspPheGlnlleThrAlaSerGlyGlnTyrGlyGlnTrpAlaProLysLeuAlaArgLeuHisTyrSerGly 
3529 AGCATCAACGCCTGGAGCACCAAGGAGCCCTTCAGCTGGATCAAGGTGGACCTGCTGGCCCCCATGATCATC 
117 !► SerlleAsnAlaTrpSerThrLysGluProPheSerTrpIleLysValAspLeuLeuAlaProMetllelle 
3601 CACGGCATCAAGACCCAGGGCGCCCGCCAGAAGTTCAGCAGCCTGTACATCAGCCAGTTCATCATCATGTAC 
1195^ HisGlylleLysThrGlnGlyAlaArgGlnLysPheSerSerLeuTyrlleSerGlnPhellelleMetTyr 
3673 AGCCTGGACGGCAAGAAGTGGCAGACCTACCGCGGCAACAGCACCGGCACCCTGATGGTGTTCTTCGGCAAC 
1219 ► SerLeuAspGlyLysLysTrpGlnThrTyrArgGlyAsnSerThrGlyThrLeuMetValPhePheGlyAsn 

(Smal/EcoRV) 

3745 GTGGACAGCAGCGGCATCAAGCACAACATCTTCAACCCCCCCATCATCGCCCGCTACATCCGCCTGCACCCC 
1243 ► ValAspSerSerGlylleLysHisAsnllePheAsnProProIlelleAlaArgTyrlleArgLeuHisPro 
3 817 ACCCACTACAGCATCCGCAGCACCCTGCGCATGGAGCTGATGGGCTGCGACCTGAACAGCTGCAGCATGCCC 
1267 ► ThrHisTyrSerlleArgSerThrLeuArgMetGluLeuMetGlyCysAspLeuAsnSerCysSerMetPro 
3889 CTGGGCATGGAGAGCAAGGCCATCAGCGACGCCCAGATCACCGCCAGCAGCTACTTCACCAACATGTTCGCC 
1291^ LeuGlyMetGluSerLysAlalleSerAspAlaGlnlleThrAlaSerSerTyrPheThrAsnMetPheAla 
3961 ACCTGGAGCCCCAGCAAGGCCCGCCTGCACCTGCAGGGCCGCAGCAACGCCTGGCGCCCCCAGGTGAACAAC 
1315 ► ThrTrpSerProSerLysAlaArgLeuHisLeuGlnGlyArgSerAsnAlaTrpArgProGlnValAsnAsn 

BstEII 

4033 CCCAAGGAGTGGCTGCAGGTGGACTTCCAGAAGACCATGAAGGTGACCGGCGTGACCACCCAGGGCGTGAAG 
1339> proLysGluTrpLeuGlnValAspPheGlnLysThrMetLysValThrGlyValThrThrGlnGlyValLys 
4105 AGCCTGCTGACCAGCATGTACGTGAAGGAGTTCCTGATCAGCAGCAGCCAGGACGGCCACCAGTGGACCCTG 
1363 ► serLeuLeuThrSerMetTyrValLysGluPheLeuIleSerSerSerGlnAspGlyHisGlnTrpThrLeu 
4177 TTCTTCCAGAACGGCAAGGTGAAGGTGTTCCAGGGCAACCAGGACAGCTTCACCCCCGTGGTGAACAGCCTG 
1387 ► PhePheGlnAsnGlyLysValLysValPheGlnGlyAsnGlnAspSerPheThrProValValAsnSerLeu 
4249 GACCCCCCCCTGCTGACCCGCTACCTGCGCATCCACCCCCAGAGCTGGGTGCACCAGATCGCCCTGCGCATG 
1411* AspProProLeuLeuThrArgTyrLeuArglleHisProGlnSerTrpValHisGlnlleAlaLeuArgMet 

Smal Hindlll 

4321 GAGGTGCTGGGCTGCGAGGCCCAGGACCTGTACTAGCTGCCCGGGCTACAAGCTTT 
1435^ GluValLeuGlyCysGluAlaGlnAspLeuTyr • • • 
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EcoRI Nhel 

1 TAGAATTCGTAGGCTAGCATGCAGATCGAGCTGAGCACCTGCTTCTTCCTGTGCCTGCTGCGCTTCTGCTTC 

]> MetGlnlleGluLeuSerThrCysPhePheLeuCysLeuLeuArgPheCysPhe 
73 AGCGCCACCCGCCGCTACTACCTGGGCGCCGTGGAGCTGAGCTGGGACTACATGCAGAGCGACCTGGGCGAG 
19 ► serAlaThrArgArgTyrTyrLeuGlyAlaValGluLeuSerTrpAspTyrMetGlnSerAspLeuGlyGlu 
145 CTGCCCGTGGACGCCCGCTTCCCCCCCCGCGTGCCCAAGAGCTTCCCCTTCAACACCAGCGTGGTGTACAAG 
43 ► LeuProValAspAlaArgPheProProArgValProLysSerPheProPheAsnThrSerValValTyrLys 
217 AAGACCCTGTTCGTGGAGTTCACCGACCACCTGTTCAACATCGCCAAGCCCCGCCCCCCCTGGATGGGCCTG 
67 ► LysThrLeuPheValGluPheThrAspHisLeuPheAsnlleAlaLysProArgProProTrpMetGlyLeu 

Apal Mscl 

289 CTGGGCCCCACCATCCAGGCCGAGGTGTACGACACCGTGGTGATCACCCTGAAGAACATGGCCAGCCACCCC 
91*^ LeuGlyProThrlleGlnAlaGluValTyrAspThrValVallleThrLeuLysAsnMetAlaSerHisPro 
361 GTGAGCCTGCACGCCGTGGGCGTGAGCTACTGGAAGGCCAGCGAGGGCGCCGAGTACGACGACCAGACCAGC 
115 ► valSerLeuHisAlaValGlyValSerTyrTrpLysAlaSerGluGlyAlaGluTyrAspAspGlnThrSer 
433 CAGCGCGAGAAGGAGGACGACAAGGTGTTCCCCGGGGGCAGCCACACCTACGTGTGGCAGGTGCTGAAGGAG 
139 ► GlnArgGluLysGluAspAspLysValPheProGlyGlySerHisThrTyrValTrpGlnValLeuLysGlu 

Mscl Pmll 
505 AACGGCCCCATGGCCAGCGACCCCCTGTGCCTGACCTACAGCTACCTGAGCCACGTGGACCTGGTGAAGGAC 
163 ► AsnGlyProMetAlaSerAspProLeuCysLeuThrTyrSerTyrLeuSerHisValAspLeuValLysAsp 

Mscl 

577 CTGAACAGCGGCCTGATCGGCGCCCTGCTGGTGTGCCGCGAGGGCAGCCTGGCCAAGGAGAAGACCCAGACC 
187 ► LeuAsnSerGlyLeuIleGlyAlaLeuLeuValCysArgGluGlySerLeuAlaLysGluLysThrGlnThr 
649 CTGCACAAGTTCATCCTGCTGTTCGCCGTGTTCGACGAGGGCAAGAGCTGGCACAGCGAGACCAAGAACAGC 
211 ► LeuHisLysPhelleLeuLeuPheAlaValPheAspGluGlyLysSerTrpHisSerGluThrLysAsnSer 
721 CTGATGCAGGACCGCGACGCCGCCAGCGCCCGCGCCTGGCCCAAGATGCACACCGTGAACGGCTACGTGAAC 
235* LeuMetGlnAspArgAspAlaAlaSerAlaArgAlaTrpProLysMetHisThrValAsnGlyTyrValAsn 

Pmll 

793 CGCAGCCTGCCCGGCCTGATCGGCTGCCACCGCAAGAGCGTGTACTGGCACGTGATCGGCATGGGCACCACC . 
259 ► ArgSerLeuProGlyLeuIleGlyCysHisArgLysSerValTyrTrpHisVallleGlyMetGlyThrThr 
865 CCCGAGGTGCACAGCATCTTCCTGGAGGGCCACACCTTCCTGGTGCGCAACCACCGCCAGGCCAGCCTGGAG 
283 ► proGluValHisSerllePheLeuGluGlyHisThrPheLeuValArgAsnHisArgGlnAlaSerLeuGlu 
937 ATCAGCCCCATCACCTTCCTGACCGCCCAGACCCTGCTGATGGACCTGGGCCAGTTCCTGCTGTTCTGCCAC 
307 ► HeSerProIleThrPheLeuThrAlaGlnThrLeuLeuMetAspLeuGlyGlnPheLeuLeuPheCysHis 

1009 ATCAGCAGCCACCAGCACGACGGCATGGAGGCCTACGTGAAGGTGGACAGCTGCCCCGAGGAGCCCCAGCTG 
331^ HeSerSerHisGlnHisAspGlyMetGluAlaTyrValLysValAspSerCysProGluGluProGlnLeu 

1081 CGCATGAAGAAC AACGAGGAGGCCGAGGACTAC GACGACGAC CTGACCGAC AGCGAGATGGACGTGGTGCGC 
355 ► ArgMetLysAsnAsnGluGluAlaGluAspTyrAspAspAspLeuThrAspSerGluMetAspValValArg 

(Bglll/BamHI) 

1153 TTCGACGACGACAACAGCCCCAGCTTCATCCAGATCCGCAGCGTGGCCAAGAAGCACCCCAAGACCTGGGTG 
379 ► pheAspAspAspAsnSerProSerPhelleGlnlleArgSerValAlaLysLysHisProLysThrTrpVal 

1225 CACTACATCGCCGCCGAGGAGGAGGACTGGGACTACGCCCCCCTGGTGCTGGCCCCCGACGACCGCAGCTAC 
403 ^ HisTyrlleAlaAlaGluGluGluAspTrpAspTyrAlaProLeuValLeuAlaProAspAspArgSerTyr 

Eagl 

1297 AAGAGCCAGTACCTGAACAACGGCCCCCAGCGCATCGGCCGCAAGTACAAGAAGGTGCGCTTCATGGCCTAC 
427 ► LysSerGlnTyrLeuAsnAsnGlyProGlnArglleGlyArgLysTyrLysLysValArgPheMetAlaTyr 

Apal 

1369 ACCGACGAGACCTTCAAGACCCGCGAGGCCATCCAGCACGAGAGCGGCATCCTGGGCCCCCTGCTGTACGGC 
451 ► ThrAspGluThrPheLysThrArgGluAlalleGlnHisGluSerGlylleLeuGlyProLeuLeuTyrGly 
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1441 GAGGTGGGCGACACCCTGCTGATCATCTTCAAGAACCAGGCCAGCCGCCCCTACAACATCTACCCCCACGGC 
475 ► GluValGlyAspThrLeuLeuIlellePheLysAsnGlnAlaSerArgProTyrAsnlleTyrProHisGly 

1513 ATCACCGACGTGCGCCCCCTGTACAGCCGCCGCCTGCCCAAGGGCGTGAAGCACCTGAAGGACTTCCCCATC 
499 ► HeThrAspValArgProLeuTyrSerArgArgLeuProLysGlyValLysHisLeuLysAspPheProIle 
Bglll 

1585 CTGCCCGGCGAGATCTTCAAGTACAAGTGGACCGTGACCGTGGAGGACGGCCCCACCAAGAGCGACCCCCGC 
523 ► LeuProGlyGluIlePheLysTyrLysTrpThrValThrValGluAspGlyProThrLysSerAspProArg 

1657 TGCCTGACCCGCTACTACAGCAGCTTCGTGAACATGGAGCGCGACCTGGCCAGCGGCCTGATCGGCCCCCTG 
547 ► cysLeuThrArgTyrTyrSerSerPheValAsnMetGluArgAspLeuAlaSerGlyLeuIleGlyProLeu 

1729 CTGATCTGCTAC AAGGAGAGCGTGGACCAGCGCGGC AACC AGATC ATGAGCGAC AAGCGC AACGTGATCCTG 
571 ► LeuiieCysTyrLysGluSerValAspGlnArgGlyAsnGlnlleMetSerAspLysArgAsnVall LeLeu 

Kpnl 

1801 TTCAGCGTGTTCGACGAGAACCGCAGCTGGTACCTGACCGAGAACATCCAGCGCTTCCTGCCCAACCCCGCC 

595 ► pheSerValPheAspGluAsnArgSerTrpTyrLeuThrGluAsnlleGlnArgPheLeuProAsnProAla 
1873 GGCGTGCAGCTGGAGGACCCCGAGTTCCAGGCCAGCAACATCATGCACAGCATCAACGGCTACGTGTTCGAC 

619 ► GlyValGlnLeuGluAspProGluPheGlnAlaSerAsnlleMetHisSerlleAsnGlyTyrValPheAsp 
1945 AGCCTGCAGCTGAGCGTGTGCCTGCACGAGGTGGCCTACTGGTACATCCTGAGCATCGGCGCCCAGACCGAC 

643 ► SerLeuGlnLeuSerValCysLeuHisGluValAlaTyrTrpTyrlleLeuSerlleGlyAlaGlnThrAsp 
2017 TTCCTGAGCGTGTTCTTCAGCGGCTACACCTTCAAGCACAAGATGGTGTACGAGGACACCCTGACCCTGTTC 

667 ► pheLeuSerValPhePheSerGlyTyrThrPheLysHisLysMetValTyrGluAspThrLeuThrLeuPhe 

BamHl 

2089 CCCTTCAGCGGCGAGACCGTGTTCATGAGCATGGAGAACCCCGGCCTGTGGATCCTGGGCTGCCACAACAGC 
691 ► proPheSerGlyGluThrValPheMetSerMetGluAsnProGlyLeuTrpIleLeuGlyCysHisAsnSer 

2161 GACTTCCGCAACCGCGGCATGACCGCCCTGCTGAAGGTGAGCAGCTGCGACAAGAACACCGGCGACTACTAC 
715 ► AspPheArgAsnArgGlyMetThrAlaLeuLeuLysValSerSerCysAspLysAsnThrGlyAspTyrTyr 

2233 GAGGACAGCTACGAGGACATCAGCGCCTACCTGCTGAGCAAGAACAACGCCATCGAGGCCCGCAGGCGC^GG 
739 ► GluAspSerTyrGluAspIleSerAlaTyrLeuLeuSerLysAsnAsnAlalleGluProArgArgArgArg 

BstXI 

2305 CGCGAGATCACCCGC ACCACCCTGCAGAGCGACCAGGAGGAGATCGACTACGACGACACCATCAGCGTGGAG 
763 ► ArgGluIleThrArgThrThrLeuGlnSerAspGlnGluGluIleAspTyrAspAspThrlleSerValGlu 

2377 ATGAAGAAGGAGGACTTCGACATCTACGACGAGGACGAGAACCAGAGCCCCCGCAGCTTCCAGAAGAAGACC 
787 ► MetLysLysGluAspPheAspIleTyrAspGluAspGluAsnGlnSerProArgSerPheGlnLysLysThr 

Pmll 

2449 CGCCACTACTTCATCGCCGCCGTGGAGCGCCTGTGGGACTACGGCATGAGCAGCAGCCCCCACGTGCTGCGC 
811 ► ArgHisTyrPhelleAlaAlaValGluArgLeuTrpAspTyrGlyMetSerSerSerProHisValLeuArg 

2521 AACCGCGCCCAGAGCGGCAGCGTGCCCCAGTTCAAGAAGGTGGTGTTCCAGGAGTTCACCGACGGCAGCTTC 
835 ► AsnArgAlaGlnSerGlySerValProGlnPheLysLys Valval PheGlnGluPheThrAspGlySerPhe 

Apal 

2593 ACCCAGCCCCTGTACCGCGGCGAGCTGAACGAGCACCTGGGCCTGCTGGGCCCCTACATCCGCGCCGAGGTG 
859 ► ThrGlnProLeuTyrArgGlyGluLeuAsnGluHisLeuGlyLeuLeuGlyProTyrlleArgAlaGluVal 

BstEII 

2665 GAGGACAACATCATGGTGACCTTCCGCAACC AGGCCAGCCGCCCCTACAGCTTCTACAGC AGCCTGATC AGC 
883 ► GluAspAsnlleMetValThrPheArgAsnGlnAlaSerArgProTyrSerPheTyrSerSerLeuIleSer 

2737 TACGAGGAGGACCAGCGCCAGGGCGCCGAGCCCCGCAAGAACTTCGTGAAGCCCAACGAGACCAAGACCTAC 
907 ► TyrGluGluAspGlnArgGlnGlyAlaGluProArgLysAsnPheValLysProAsnGluThrLysThrTyr 

2809 TTCTGGAAGGTGCAGCACCACATGGCCCCCACCAAGGACGAGTTCGACTGCAAGGCCTGGGCCTACTTCAGC 
931^ pheTrDLvsValGlnHisHisMetAlaProThrLvsAsDGluPheAsDCvsLvsAlaTroAlaTvrPheSer 


FIG. 9B 


Page 30 of 33 
OPTIMIZED MESSENGER RNA 
09/407,605 
10278-009001 
REPLACEMENT SHEET 


2881 GACGTGGACCTGGAGAAGGACGTGCACAGCGGCCTGATCGGCCCCCTGCTGGTGTGCCACACC AACACCCTG 
955 ► AspValAspLeuGluLysAspValHisSerGlyLeuIleGlyProLeuLeuValCysHisThrAsnThrLeu 
Eagl BstEII 

2953 AACCCCGCCCACGGCCGCCAGGTGACCGTGCAGGAGTTCGCCCTGTTCTTCACCATCTTCGACGAGACCAAG 
979 ► AsnProAlaHisGlyArgGlnValThrValGlnGluPheAlaLeuPhePheThrllePheAspGluThrLys 
3025 AGCTGGTACTTCACCGAGAACATGGAGCGCAACTGCCGCGCCCCCTGCAACATCCAGATGGAGGACCCCACC 
1003 ► serTrpTyrPheThrGluAsnMetGluArgAsnCysArgAlaProCysAsnlleGlnMetGluAspProThr 
3097 TTCAAGGAGAACTACCGCTTCCACGCCATCAACGGCTACATCATGGACACCCTGCCCGGCCTGGTGATGGCC 
1027 ► pheLysGluAsnTyrArgPheHisAlalleAsnGlyTyrlleMetAspThrLeuProGlyLeuValMetAla 

Kpnl 

3169 CAGGACC AGCGCATCCGCTGGTACCTGCTGAGCATGGGCAGCAACGAGAACATCCACAGCATCCACTTCAGC 
1051^ GlnAspGlnArglleArgTrpTyrLeuLeuSerMetGlySerAsnGluAsnlleHisSerlleHisPheSer 

Pmll 

3241 GGCCACGTGTTCACCGTGCGCAAGAAGGAGGAGTACAAGATGGCCCTGTACAACCTGTACCCCGGCGTGTTC 
1075* GlyHisValPheThrValArgLysLysGluGluTyrLysMetAlaLeuTyrAsnLeuTyrProGlyValPhe 
3313 GAGACCGTGGAGATGCTGCCCAGCAAGGCCGGCATCTGGCGCGTGGAGTGCCTGATCGGCGAGCACCTGCAC 
1099 ► GluThrValGluMetLeuProSerLysAlaGlylleTrpArgValGluCysLeuIleGlyGluHisLeuHis 
3385 GCCGGCATGAGGACCCTGTTCCTGGTGTACAGCAACAAGTGCCAGACCCCCCTGGGCATGGCCAGCGGCCAC 
1123 ► AlaGlyMetSerThrLeuPheLeuValTyrSerAsnLysCysGlnThrProLeuGlyMetAlaSerGlyHis 

Apal 

3457 ATCCGCGACTTCCAGATCACCGCCAGCGGCCAGTACGGCCAGTGGGCCCCCAAGCTGGCCCGCCTGCACTAC 
1147 ► HeArgAspPheGlnlleThrAlaSerGlyGlnTyrGlyGlnTrpAlaProLysLeuAlaArgLeuHisTyr 
3529 AGCGGCAGCATCAACGCCTGGAGCACCAAGGAGCCCTTCAGCTGGATCAAGGTGGACCTGCTGGCCCCCATG 
1171* SerGlySerlleAsnAlaTrpSerThrLysGluProPheSerTrpIleLysValAspLeuLeuAlaProMet 
3601 ATCATCCACGGCATCAAGACCCAGGGCGCCCGCCAGAAGTTCAGCAGCCTGTACATCAGCCAGTTCATCATC 
1195* HelleHisGlylleLysThrGlnGlyAlaArgGlnLysPheSerSerLeuTyrlleSerGlnPhellelle 
3673 ATGTACAGCCTGGACGGCAAGAAGTGGCAGACCTACCGCGGCAACAGCACCGGCACCCTGATGGTGTTCTTC 
1219* MetTyrSerLeuAspGlyLysLysTrpGlnThrTyrArgGlyAsnSerThrGlyThrLeuMetValPhePhe 

(Smal/EcoRV) 

3745 GGCAACGTGGACAGCAGCGGCATCAAGCACAACATCTTCAACCCCCCCATCATCGCCCGCTACATCCGCCTG 
1243 ► GlyAsnValAspSerSerGlylleLysHisAsnllePheAsnProProIlelleAlaArgTyrlleArgLeu 
3817 CACCCCACCCACTACAGCATCCGCAGCACCCTGCGCATGGAGCTGATGGGCTGCGACCTGAACAGCTGCAGC 
1267* HisProThrHisTyrSerlleArgSerThrLeuArgMetGluLeuMetGlyCysAspLeuAsnSerCysSer 
3889 ATGCCCCTGGGCATGGAGAGCAAGGCCATCAGCGACGCCCAGATCACCGCCAGCAGCTACTTCACCAACATG 
1291^MetProLeuGlyMetGluSerLysAlaIleSerAspAlaGlnIleThrAlaSerSerTyrPheThrAsnMet 
3961 TTCGCCACCTGGAGCCCCAGCAAGGCCCGCCTGCACCTGCAGGGCCGCAGCAACGCCTGGCGCCCCCAGGTG 
1315^ pheAlaThrTrpSerProSerLysAlaArgLeuHisLeuGlnGlyArgSerAsnAlaTrpArgProGlnVal 

BstEII 

4033 AACAACCCCAAGGAGf GGCTGCAGGTGGACTTCCAGAAGACCATGAAGGTGACCGGCGTGACC ACCCAGGGC 
1339 ► AsnAsnProLysGluTrpLeuGlnValAspPheGlnLysThrMetLysValThrGlyValThrThrGlnGly 
4105 GTGAAGAGCCTGCTGACCAGCATGTACGTGAAGGAGTTCCTGATCAGCAGCAGCCAGGACGGCCACCAGTGG 
1363 ► ValLysSerLeuLeuThrSerMetTyrValLysGluPheLeuIleSerSerSerGlnAspGlyHisGlnTrp 
4177 ACCCTGTTCTTCCAGAACGGCAAGGTGAAGGTGTTCCAGGGCAACGAGGACAGCTTCACCCCCGTGGTGAAC 
1387 ► ThrLeuPhePheGlnAsnGlyLysValLysValPheGlnGlyAsnGlnAspSerPheThrProValValAsn 
4249 AGCCTGGACCCCCCCCTGCTGACCCGCTACCTGCGCATCCACCCCCAGAGCTGGGTGCACCAGATCGCCCTG 
1411 ► serLeuAspProProLeuLeuThrArgTyrLeuArglleHisProGlnSerTrpValHisGlnlleAlaLeu 

Smal Hindlll 

4321 CGCATGGAGGTGCTGGGCTGCGAGGCCCAGGACCTGTACTAGCTGCCCGGGCTACAAGCTTTAC 
1435^ ArgMetGluValLeuGlyCysGluAlaGlnAspLeuTyr • • • 
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Codon Frequency in Highly Expressed Human Genes 
% occurrence % occurrence % occurrence 

Glu Cvs Gin 

GA A 25 TG C 68 CA A 12 

G 75 T 32 G 88 

Are Ala Glv 

CG C 37 GC C 53 GG C 50 

T 7 T 17 T 12 

A 6 A 13 A 14 

G 21 G 17 G 24 

AG A 10 

G 18 

Leu Ser Pro 

CT C 26 TC C 28 CC C 48 

T 5 T 13 T 19 

A 3 A 5 A 16 

G 58 G 9 G 17 

TT A 2 AG C 34 

G 6 T 10 
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Codon Frequency in Highly Expressed Human Genes 
% occurrence % occurrence % occurrence 

lie Thr Val 

AT C 77 AC C 57 GT C 25 

T 18 T 14 T 7 

A 5 A 14 A. 5 

G 15 G 64 

Tvr Phe Lys 

TA C 74 TT C 80 AA A 18 

T 26 T 20 G 82 


Asn His 
AA C 78 CA C 79 

T 25 T 21 
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